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wherein the processor includes: 

classification allocation means f<k allocating potential classifications to each said 
unclassified example and for generating a plurality of classification sets, each said 
classification set containing said plurality lof training classified examples and said at least 
one unclassified example with its said allocated potential classification; 

assay means for determining a strangeness value valid under the iid assumption for 
each said classification set; 

a comparative device for selecting the classification set to which the most likely 
allocated potential classification for said at east one unclassified example belongs, 
wherein said predicted classification output by the output terminal is said most likely 
allocated classification according to said strangeness values assigned by said assay 
means; and 

a strength of prediction monitoring deWice for determining a confidence value for 
said predicted classification on the basis of said strangeness value assigned by said assay 
means to one of said classification sets to which the second most likely allocated potential 
classification of said at least one unclassified ebcample belongs. 



1 1 . Data classification 
processor further includes 
strangeness values for each s) 




ipparatus as claimed in claim 10, wherein said 
pie valuation device which determines individual 
g classified example and said at least one 



unclassified example having an allocated potential classification. 
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12. Data classification apparatus as claimed in claim 11, wherein Lagrange 
multipliers are used to determine said individual strangeness values. 



13. Data classification a] 
means determines a strangeness value 




s as claimed in claim 1 1 , wherein said assay 
'ach said classification set in dependence on 



said individual strangeness values of eadh said example 



14. Data classification apparatus comprising: 



urality of training classified examples and at least 



i ed and unclassified examples; 
ample classification program; 
a predicted classification for said at least one 



unclassified example and for generating 



an input device for receiving a p 
one unclassified example; 

a memory for storing said classi 
stored programs including an ex; 
an output terminal for outputting 
unclassified example; and 

a processor controlled by said stored programs for identifying the predicted 
classification of said at least one unclassified example, 
wherein said processor includes: 

classification allocation means f<j)r allocating potential classifications to each said 

a plurality of classification sets, each said 
classification set containing said plurality of training classified examples and said at least 
one unclassified example with its alloc; ited potential classification; 

assay means for determining a strangeness value valid under the iid assumption for 
each said classification set; 
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a comparative device for selecting the classification set to which the most likely 
allocated potential classification for said at least lone unclassified example belongs, 
wherein the predicted classification output by sdid output terminal is the most likely 
allocated potential classification according to s^lid strangeness values assigned by said 
assay means and 

a strength of prediction monitoring device for determining a confidence value for 
said predicted classification on the basis of sekd strangeness value assigned by said assay 
means to one of said classification sets to which the second most likely allocated potential 
classification of said at least one unclassified example belongs. 



15. A data classification method 



:ompnsing: 



inputting a plurality of training classified examples and at least one unclassified 
example; 

identifying a predicted classification of said at least one unclassified example 
which includes, 

allocating potential classifications to each said unclassified example; 

generating a plurality of classification sets, each said classification set containing 
said plurality of training classified examples and said at least one unclassified example 
with its allocated potential classification; 

determining a strangeness value vjklid under the iid assumption for each said 
classification set; 
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selecting the said classification set A which the most likely allocated potential 
classification for said at least one unclassified example belongs, wherein said predicted 
classification is the most likely allocated potential classification in dependence on said 
strangeness values; 

determining a confidence value for said predicted classification on the basis of the 
strangeness value assigned to one of said classification sets to which the second most 



likely allocated potential classification for s£ 
and 



id at least one unclassified example belongs; 



outputting said predicted classification for said at least one unclassified example 
and said confidence value for said predicted classification. 



16. A data classification meJlScras claimed in claim 15, further including 
determining individual strangeness valu^gppr each said training classified example and 

said at least one unclassified example having an allocated potential classification 

V 

17. A data classification methoc as claimed in claim 15, wherein said selected 
classification set is selected without the application of any general rules determined from 
the said training set. 



18. 



A data carrier on which is 



data by performing the following steps: 



tored a classification program for classifying 
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under the iid assumption for each said 



generating a plurality of classification sets, each said classification set containing a 
plurality of training classified examples and £t least one unclassified example that has 
been allocated a potential classification; 

determining a strangeness value valid 
classification set; 

selecting the classification set to whicll the most likely allocated potential 
classification for the said at least one unclassified example belongs, wherein the predicted 
classification is the most likely allocated potential classification in dependence on said 
strangeness values; and 

determining a confidence value for said predicted classification on the basis of 
said strangeness value assigned to one of said classification sets to which the second most 
likely allocated potential classification for said at least one unclassified example belongs. 
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